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Abstract 

We study the functional link between the Hurst parameter and the Normalized Total 
Wavelet Entropy when analyzing fractional Brownian motion (fBm) time series — 
these series are synthetically generated. Both quantifiers are mainly used to identify 
fractional Brownian motion processes (Fractals 12 (2004) 223). The aim of this work 
is understand the differences in the information obtained from them, if any. 
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1 Introduction 



When studying the laser beam propagation through a laboratory-generated 
turbulence [1] we have introduced two quantifiers: the Hurst parameter, H, and 
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the Normalized Total Wavelet Entropy (NTWS), Swt- The former quantifier 
was introduced to test how good the family of fractional Brownian motion [2] 
(fBm) processes model the wandering of such laser beam, while the NTWS is 
a more general quantifier aimed to study any given dynamic system [3]. Also, 
in a recent work we have analyzed the dynamic case: the laboratory-generated 
turbulence was set up to change in time [4]. We have observed that these 
quantifiers are correlated, but at the time only a qualitative argument was 
given. Furthermore, each one of these quantifiers have been used separately 
to obtain information from biospeckle phenomenon [5, 6]. 

The fBm is the only one family of processes which are self-similar, with sta- 
tionary increments, and gaussian [7]. The normalized family of these gaussian 
processes, B H , is the one with B H (0) = almost surely, ~E[B H (t)] = 0, and 
covariance 



E 



B H (t)B H (s)\ =\{\t\ 2H + \s\ 2H - \t - s\ 2H ) , (1) 



for s,tGl. Here E[-] refers to the average with gaussian probability density. 
The power exponent H has a bounded range between and 1. These processes 
exhibit memory, as can be observed from Eq. (1), for any Hurst paremeter but 
H = 1/2. In this case successive Brownian motion increments are as likely to 
have the same sign as the opposite, and thus there is no correlation. Oth- 
erwise, it is the Brownian motion that splits the family of fBm processes in 
two. When H > 1/2 the correlations of successive increments decay hyperboli- 
cally, and this sub-family of processes have long-memory. Besides, consecutive 
increments tend to have the same sign, these processes are persistent. For 
H < 1/2, the correlations of the increments also decay but exponentially, and 
this sub-family presents short-memory. But since consecutive increments are 
more likely to have opposite signs, it is said that these are anti-persitent. 

The Wavelet Analysis is one of the most useful tools when dealing with data 
samples. Thus, any signal can be descomposed by using a diadic discrete family 
{2^ 2 ip(2H — k)} — an orthonormal basis for L 2 (Kf) — of translations and scaling 
functions based on a function ip: the mother wavelet. This wavelet expansion 
has associated wavelet coefficients given by Cj(k) = (S,2^ 2 ip(2^ ■ —k)). Each 
resolution level j has an associated energy given by Sj = E|Cj(fc)r. If the 
signal has stationary increments the coefficients are independent on k and 
then the relative wavelet energy, RWE, is 

Pi = (2) 



with j G {— N, . . . , — 1}, where N = log 2 M is the number of sample points 



and Stat = J27=-n ^ \Cj\ is the total energy. Thus the NTWS is defined as 



2 



(see [1] and references therein) 

Swt = - E Pj ■ log 2Pj /S max , with S max = log 2 N. (3) 

j=-N 



For a signal originated from a fBm the energy per resolution level can be 
calculated using the formalism introduced in Ref. [8], see Appendix 3, 

E \C 3 {k)\ 2 = 2T(2H + 1) sin(7rtf)2-^ +1 ) (4) 

for any mother wavelet election satisfying f R ijj = 0. From (4) the relative 
wavelet energy for a finite data sample is 

1 _ 0-(l+2ff) 

which becomes independent on wavelet basis. And so it does the normalized 
total wavelet entropy, 



N 



2i+2H _ i 2 N ( 1+2H ) — 1 



log 2 



logo N 



1 _ 2-( 1 + M ) 
1 _ 2~ N ( 1 + 2H ) 



■ (6) 



As it was expected the entropy decreases when H increases, with H measuring 
the level of order of the signal. 



2 Simulations and tests 



To test the functional relation between the Hurst exponent and NTWS we 
have simulated 50 fractional Brownian motion data samples [9] for each H G 
{0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9}. Since we have examined data of 5000 
point in length in Ref. [1], these samples are set to the same length. For 
each set we estimate H and S'wT- Moreover, we employ an orthogonal cubic 
spline function as mother wavelet. Among several alternatives, the cubic spline 
function is symmetric and combine in a suitable proportion smoothness with 
numerical advantages. It has become a recommendable tool for representing 
natural signals [10, 11]. 

The Hurst parameter is estimated as usual: by plotting the logarithm of the 
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estimated energy per resolution level j, 

i Nj 

iy j k=i 

with Nj = 2 _J M the number of coefficients at resolution level j, versus j and 
fitting a minimum square line. The slope of the line is the desired estimator. 

For NTWS we start dividing the signal into Nt non-overlapping temporal win- 
dows with length L, Nt = M/L. The wavelet energy estimator at resolution 
level j for the time window i is given by 

Sf = ^ E C?(*), withi = l,..-,JVr, (8) 

where iV} represents the number of wavelet coefficients at resolution level j 
corresponding to the time window i; while the total energy estimator in this 

?(*) _ p(* 

■'tot ~~ ^j<0 °j 



time window will be S^t = Exo^'' 



Hence, the time evolution estimators of RWE and NTWS will be given by: 

if = sf/m, (9) 



s WT ^ = -J2Pf } -^g 2 p ( f/s—. (io) 

j<0 



In order to obtain a quantifier for the whole time period under analysis [3] the 
temporal average is evaluated. The temporal average of NTWS is given by 

1 Nt ^ ■ 

(Swt) = T7- E &WT> (11) 
7V T i=l 

and for the wavelet energy at resolution level j 

then the total wavelet energy temporal average is defined as (£ to t) = J2j<o(Sj)- 
In consequence, a mean probability distribution {qj} representative for the 
whole time interval (the complete signal) can be defined as 

qj = (Sj) /(Su^, (13) 
with Yjj Qj — 1 an d f ne corresponding mean NTWS as 

Swr = -'Eqj-1oB2<b/S BU *. (14) 

j<0 
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In Figure 1 we compare H against its estimator. It has a good performance for 
0.3 < H < 0.9 and fails outside. Furthermore, the estimators fits better the 
larger values. Figure 2 represents the temporal average of NTWS, (Swt), and 
the mean NTWS, Swt, estimated with our procedure and compared against 
the theoretical result in eq. (6) with N = 12. As usual, boxplots [12] show 
lower and upper lines at the lower quartile (25th percentile of the sample) and 
upper quartile (75th percentile of the sample) and the line in the middle of 
the box is the sample median. The whiskers are lines extending from each end 
of the box indicating the extent of the rest of the sample. Outliers are marked 
by plus signs. These points may be considered the result of a data entry error 
or a poor measurement. 



3 Conclusions 



For a fBm we have found, eq. (6), there is an inverse dependence: as H grows 
the temporal average, (Swt), an d mean NTWS, Swt, diminishes. It is verified 
with the synthetic fBm data samples. This relation is logical, the spectrum has 
less high-frequency components as H gets higher and all the energy is closer 
to the origin, and, if H gets lower the energy contribution at high frequencies 
becomes relevant. Observe that the closer H is to the exact value, the better 
are the results for both estimators of the entropy. 

From an analytical point of view both H and Swt are equivalent. Although, 
the NTWS also contains information about the extension of the data set. 
Nevertheless, from a computational point of view the latter is independent 
on the scaling region, making the entropy less subjective. On the other hand 
the logarithm in the entropy definition introduces important errors, as we see 
in Figure 2. To narrow these it is necessary to increase the data samples. It 
should be stressed that extending the length of the data samples reduces the 
statistical error. 
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APPENDIX A 



Let us take as the signal S(t) = B H (t,u>), u is fixed and represents one 
element of the statistic ensemble and it will be omitted hereafter. The wavelet 
coefficients are calculated using the orthonormal wavelet basis \2~^ 2, ^)(2r^ ■ 

Cf(k) = [ 2- j/2 tlj(2- J s-k)B H (s)ds = 2 {1/2+H)j [ tlj(s)B H (s + k)ds, (15) 

JR JR 

for the last step we used the self-similar property of the fBm; that is, B H {t) = 
c H B H (c~ l t). Since the fBm can be written, using the chaos expansion de- 
scribed in Ref. [8], as 

oo 

B H (t) = Y;(M H l M ,t n )H en (u), 

n=l 

where {£ n }neN are the Hermite functions, and the operator M# is defined as 
follow [13] 

M3>(y) = c h \v\ 1/2 - h $(v), (16) 

where the hat stands for the Fourier transform, c 2 H = T(2H + 1) sin(7rif ), and 
<fi is any function in L 2 (IR). Then, we introduce the following coefficients 

d*{k)= f (M H l [0)a+k] ,^(s)ds, (17) 

to finally obtain: 

oo 

Cf (k) = 2(W £ d%(k)H en (cu). (18) 

n=l 

The evaluation of the coefficients d^(k) is straightforward from their definition 
and eq. (16): 

dn(k) = ~l sgnu \v\-U 2+H ^(v)Uv)e- iku dv, (19) 
where ^(v) = Tp(v). 

The chaos expansion in eq. (18) corresponds to a Gaussian process, then for 
integers j, k,j f , k' the correlation is equal to [14] 

oo 

= E2 (1/2+H)a+/) ^(fc)^(fc')- 

n=l 

Now, from eq. (19) and orthogonality of the Hermite functions the above 



E 



Cf{k)Cf{k' 
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equation is rewritten in the following way 



E 



Cf (k)CF (A/)l = c&2<V3+*)Ci+i') f \u\^ 1+2 ^\^(u)\ 2 e-^ k - k '>du, (20) 



which is the usual expresion found in many works, see [15] and references 
therein. The integral has convergence problems near the origin. These are 
resolved chosing a mother wavelet <fi with K null moments. That is, 



Jm. 



dt = 0, 



for k = 0, • • • , K — 1. Therefore, |\l/(z/)| 2 = a\ \vf K + 0,2 \u\ 2K+1 + o(|i/| 2A ' +1 ). 
When k and fc' are far apart, i. e., m = k — k' —* 00, the integral in eq. (20) is 
dominated by the contribution of frequencies in the interval [0, 1], thus giving 



E 



Cf (k)Cf (Jfc')l « 2a 1 c|2( 1 / 2+ - ff )^+^') / z/ 



,2/iT-2//-l 



2a 1 42 (1/2+H)(j+J " ) r(2K - 2H) cos(tt(A' - H))m 

+ 0{m 2 



cos(mis) dv 

2K-2H 
2K-2H-1 



), (21) 



for K > H. The coefficients of a wavelet expansion are highly correlated. But , 
for j = j' and k = k', 



E 



Cf{k) 2 = c 2 H 2^ +2H ^ [ 



-(l+2iJ) 



= 2r(2F + 1) sin(7riJ)2 (1+2 ^ / ^ 1+2 ") I^MI 2 d^, (22) 

we recover the mean energy by resolution level j. Therefore, the RWE is 
obtained replacing the above into eq. (2): 



2.7(1+2//) 



Pj 



J2;L N y {1+2H) 



1 _ 2-( 1 + 2 ^) 
1 _ 2-JV(i+2tf) 



2 (j+l)(l+2//)_ 



(23) 



where the last equation comes from the evaluation of the geometric series 
corresponding to the total energy. Its logarithm (base 2) is simply log 2 Pj = 
(1 + 2H)(J + 1) + log 2 [l - 2"( 1+2H )/1 - 2~ N ^ +2H \ Finally, using these results 
in the definition of NTWS, it is 



S WT (N,H) 



1 _ 2~( 1 + 2 ^) 
1 _ 2-^(1+2/0 

1 _ 2~( 1 + 2 ^) 
1 _ 2-^(1+2//) 



;i + 2H) J2 U + 1)2^' +1 )( 1+2H ) 

j=-N 



log 2 



1 _ 2"( 1 + 2 ^) 
1 _ 2-^(1+2//) 



£ 2 0'+i)(i+2/0 ; (24) 



then handling the geometric sums carefully we obtain eq. (6). 
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Fig. 1. The estimator for the Hurst parameter is plotted against the expected value, 
the dashed line is the identity. 
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Fig. 2. (Swt) (top) and SwT (bottom) are compared against Swt(12, H) (dashed 
line) as denned through eq. (6). 
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